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A non-equilibrium, generally time-dependent, environment whose form is deduced 
' by optimal learning control is shown to provide a means for incoherent manipulation 

o ■ 

' of quantum systems. Incoherent control by the environment (ICE) can serve to steer 

p) ^. a system from an initial state to a target state, either mixed or in some cases pure, by 

I exploiting dissipative dynamics. Implementing ICE with either incoherent radiation 

' or a gas as the control is explicitly considered, and the environmental control is 
characterized by its distribution function. Simulated learning control experiments are 

> . 

, performed with simple illustrations to find the shape of the optimal non-equilibrium 

0^ 

O ' distribution function that best affects the posed dynamical objectives. 

o> ' 

o 
\o 
o 



INTRODUCTION 



I The manipulation of coherent atomic and molecular dynamics often utilizes shaped elec- 

O^' tromagnetic fields as the control. This topic is the focus of extensive theoretical and exper- 
imental research which relies on tailoring constructive and destructive 

fXl ■ 

I interferences between different dynamical pathways of a quantum system. Many labora- 



tory implementations of quantum control with lasers use adaptive feedback learning tech- 
niques j8|. 

In the present paper we consider the manipulation of atomic and molecular dynamics 
with the control being a tailored non-equilibrium, and generally time-dependent, state of 
the surrounding environment. Different non-equilibrium states of the environment can in- 
duce correspondingly unique dynamical responses in the physical system being controlled. 
Incoherent control by the environment (ICE) is distinct from operations with coherent con- 
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trol. Control by ICE affects a system through dissipative dynamics and can be used to steer 
the system from a pure or a mixed state into mixed and in some cases pure states. Control by 
lasers normally affects the system through Hamiltonian evolution and transforms pure states 
into the pure states. In practice limitations will exist on laser controls and the capabilities 
of ICE as well. Thus, in the most general circumstance a shaped laser pulse and a tailored 
non-equilibrium environment could be combined into an overall tool to simultaneously best 
perform control by Hamiltonian and dissipative dynamics. 

This paper explicitly considers ICE implemented by two types of environments: incoher- 
ent radiation (i.e., a gas of photons) and gas of particles (i.e., electrons, atoms, or molecules). 
The particles of an environment (i.e., photons or matter) are characterized by their momenta 
k and internal degrees of freedom indexed by a. For photons a denotes polarization while 
for atoms or molecules it denotes their internal energy levels. The environment is described 
by the distribution of mean occupation numbers nk,a(^) of its microscopic states |k, a) at 
time t. Thermal equilibrium states are characterized by only a few parameters, such as 
temperature and chemical potential, which uniquely determine the corresponding equilib- 
rium distribution. Non-equilibrium states are characterized by occupation numbers of all 
microscopic states and have much richer structure. The control considered with ICE is the 
generally time dependent distribution function nk,a(i). The shape of this function (i.e., its 
dependence on k, a, and t) together with the interaction Hamiltonian determines the dy- 
namics of the system under control. Although operation with ICE does not induce coherent 
dynamics, nevertheless the shaping of nk,a(t) over k, a, and t provides considerable flexibil- 
ity for system manipulation. Many control objectives can be expressed in terms of creating 
specific mixed states of a system, which should be amenable to utilizing non-equilibrium en- 
vironmental states for their preparation. Certain pure states can also be reached with ICE. 
For example, incoherent radiation can steer a three-level A-atom from the ground state to 
the intermediate excited state, a gas can steer a two-level atom through collisions to the 
excited state, etc. 

The practical creation of controls for implementing ICE is important to consider. First, 
incoherent non-equilibrium radiation as a control includes monochromatic incoherent light, 
thermal radiation propagating in a medium with frequency sensitive absorbtion (i.e., filters), 
and radiation from luminescent emission. In the simplest case monochromatic radiation can 
be either coherent or incoherent, and in both circumstances it is composed of waves with 
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the same wavelength. In the second formulation of ICE, a non-equilibrium gas, or more 
generally a surrounding fluid or solid medium, of particles is described by its distribution over 
momenta and internal degrees of freedom (e.g., vibrational and rotational modes of molecules 
or atomic energy levels). A non-equilibrium state of a gas may be generated in a number of 
ways, including through a sudden change in its thermodynamic parameters. For example, 
if a lower energy level of a molecule relaxes to equilibrium faster than an upper level, then 
a sudden temperature drop can create a non-equilibrium state with population inversion. 
A non-equilibrium particle momentum distribution can be created using selective excitation 
of the internal degrees of freedom by a laser with subsequent coUisional transfer of the 
excitations into momentum modes. Lasers can have dual roles in this general circumstance 
of (a) directly addressing the system Hamiltonian for control as well as (b) first addressing 
the environment for its subsequent controlled manipulation of the system. 

The main impediments to designing coherent optimal control fields are the required de- 
tailed knowledge of the system Hamiltonian and the need to solve the generally high dimen- 
sional Schrodinger equation. The difficulties in handling these two issues inevitably lead to 
significant errors in the field designs which would likely result in ineffective control in labo- 
ratory experiments. To overcome these difficulties optimal learning control in the laboratory 
is proving to be very successful [8]. In this fashion the system subjected to control is used as 
an analog computer which solves its own Schrodinger equation exactly in real time and with 
the true laser field and system Hamiltonian. The capabilities of high duty cycle laser pulse 
shaping along with fast observation of the controlled outcome allows for efficient pattern 
recognition algorithms (e.g., genetic algorithms j^) to guide a sequence of experiments to 
home in on the specified system control objective. The same logic is proposed for ICE to find 
an optimal non-equilibrium environment as a control, either alone or possibly in conjunction 
with determining a coherent control field. 

This paper considers the following control problem: for a given target state of the 
system , find a distribution function of the environment r;,k,a(t) such that the corre- 
sponding induced non-unitary system dynamics is steered from some initial state pi as 
close as possible to the state . The corresponding objective function can be chosen as 
J[n\^,a{''^)] = E„m(P"m " pIm)T^^ whcrc p^m IS an element of the density matrix p = p(tf) 
at some final time tf evolved from pi under the action of the environment with distribution 
function nk,Q(t). The problem of creating an effective control based on significant environ- 
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mental interactions generally requires an adaptive learning control procedure. The advan- 
tage of learning control lies in its ability to find an optimal distribution ^^ ^(t) even if the 
details of the system, environment and their interactions are not known. In the laboratory, 
practical feedback signals would be of the form TrpO, where G is an observable operator, 
and the goal would be to steer Tr pO towards the desired value Tr p'^Q. In the more general 
case, expectation values of several possibly noncommutative observable operators G, could 
be used for feedback, and the goal would be to steer the expectation value of each Oj towards 
its desired value. Other observable properties of the system, the environment, or the means 
of generating its non-equilibrium state also can be incorporated into the objective function. 

Learning control driven by a GA is employed in the present paper in simulations of the 
potential effectiveness of ICE using either incoherent radiation or a gas as controls. In these 
cases there is a Markovian regime (i.e. , weak coupling and low density) for the reduced 



dynamics of the controlled system Il3l |. Master equations with appropriate 

dissipative terms are reasonable models in this regime. These equations are used here to 
simulate evolution of the system's density matrix towards the target under the influence of 
a non-equilibrium environment. The general ICE concept is not limited to the models used 
here, and in the laboratory fully non-Markovian dynamics naturally would be included as 
required. 

An environment prepared impulsively in a non-equilibrium state will evolve to equilib- 
rium. In the simulations below we consider control by stationary non-equilibrium environ- 
ments. In practice this requires keeping the environment in a non-equilibrium state for a 
time which is sufficiently long to perform the control. In the simulations the master equa- 
tions will be followed temporally until a steady state is reached for the control outcome. 
A stationary non-equilibrium state for incoherent radiation can easily be maintained using 
steady sources. In addition, a non-equilibrium state of a stationary gas can be maintained 
by controlling the gas with a suitable external field. An example of such control is prepara- 
tion of population inversion in the He-Ne gas-discharge laser. In this case we may consider 
the gas of He atoms as the control environment and the Ne atoms as the controlled sys- 
tem. An electric discharge is passed through the He-Ne gas to bring the He atoms into 
a non-equilibrium state. Then He-Ne collisions transfer the energy of the non-equilibrium 
state of the He atoms into the high energy levels of the Ne atoms. This process creates a 
population inversion in the Ne atoms and subsequent lasing. A steady electric discharge 
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can be used to keep the gas of helium atoms in a non-equihbrium state to produce a CW 
He-Ne laser. In the present simulations the actual nature of the external control creating the 
non-equilibrium steady distribution nk will not be explicitly described. Rather in keeping 
with exploring the ICE concept, nk will be treated directly as the control for optimization 
as a function of k. In the laboratory the external control settings would be guided by the 
learning algorithm using the system observations as a feedback signal. 

Control by several lasers with incoherent relative phases was considered in In this 
approach the interference between different channels is manipulated by changing the relative 
frequencies and intensities of the lasers. Although the relative phases between the lasers are 
incoherent, each of the lasers is a source of coherent radiation which affects the system 
through induced Hamiltonian dynamics. The present paper, in its part devoted to control 
by radiation, uses as the control totally incoherent radiation which affects the system through 
induced dissipative evolution. 

There is a relation between ICE and recentlv proposed measurement-assisted and 
dual material-photonic reagent control [16]. In all these situations control is implemented, at 
least partially, through induced dissipative dynamics. The ICE proposal directly exploits the 
influence of an optimally shaped environmental distribution function upon the dynamics of 
the controlled system. Measurement-assisted control exploits the back action of decoherence 
induced by measurements. Material-photonic reagent control utilizes dynamics driven by a 
laser as well as identification of an optimal material for the system and/or the environment. 
All these approaches to molecular or condensed phase system manipulation can be considered 
as different branches of the general perspective of introducing control through same aspects 
of dissipative dynamics. 

Sections II and III present simple illustrations of ICE using incoherent radiation and a 
gaseous medium, respectively. The formulations will be developed in the general context 
of a coherent field being present for directly addressing the system along with a time- 
dependent environmental control distribution function nk(t). The test simulations will be 
performed with steady distributions and without a coherent control field to serve as a simple 
demonstration of the basic closed-loop ICE concept. Even richer control should be possible 
with temporal control densities operating along with a coherent field e{t). Brief concluding 
remarks are given in Section IV. 
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II. INCOHERENT RADIATION SERVING AS A CONTROL 

This section considers control by non-equilibrium incoherent radiation described by a 
distribution nk of the photon momenta. In general, the polarization dependence of the 
incoherent radiation also can be exploited as an additional control along with the propagation 
direction in cases where spatial anisotropy is important (e.g., a system consisting of oriented 
molecules bound to a surface). 

The thermal (i.e., equilibrium) distribution of photons at temperature T and frequency 
uo is determined by Planck's formula A''^^ = ct;^/7r^c^[exp (/icu/^Br) — 1], where c is the 
speed of light, fee and h are the Boltzmann and the Planck constants (we set them to 1 
in the sequel). Non-equilibrium incoherent radiation may have an arbitrary distribution. 
Fig. Ogives an example of a thermal distribution and a non-equilibrium distribution, which 
corresponds to incoherent radiation composed of three nearly monochromatic components 
of differing intensity. The latter distribution may be produced either by filtering the 
thermal radiation or by using independent monochromatic sources. 




Frequency, w 



FIG. 1: The Planck density of black body radiation (upper curve) and the density (un-normalized) 
of non-equilibrium radiation composed of three nearly monochromatic sources (lower curve). 

The master equation for a system simultaneously interacting with a coherent electromag- 
netic field e{t) and an environment with distribution nk(t) generally has the form: 

^ = -i[Ho + - ^le{t),p{t)\ + £dissK(t); p(t)]. (1) 

The coherent dynamics is generated by the system Hamiltonian Hq = EnPn with eigen- 
values En and the corresponding projectors P„, the effective Hamiltonian ifefr originating 
from system-environment interaction, dipole moment /i, and electromagnetic field e{t). The 
effective Hamiltonian commutes with Hq and is not used in the following simulations. 
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The dissipative dynamics generated by the incoherent radiation distribution function 
nk(t) has the form Cdiss[nk{t); p] = Cl-^^[nk{t)] p] = J^ll^it) + 7l..W](2/ic.P/^+ - P^Pujp - 
pp^p^) where the sum is taken over all system transition frequencies. Here the coefficients 
7^(t) = TT f dk6{\k\ — u)\g(k)\'^[n\^(t) + (1 ± l)/2] determine the transition rates between 
energy levels with transition frequency uj and Puj = Yl PmpPn- The transition rates 
are the functions of the photon density nk(t) and matrix elements of the dipole moment 
p. The form-factor g{k) determines the coupling of the system to the k-th mode of the 
field. If for all k, nk = 0, (i.e., the quantum system is in a vacuum), then = 0, and the 
coefficients 7^ together with dipole moment p determine the inverse lifetime of the system's 
energy levels. The positivity of 7^(t) guarantees that the off-diagonal elements of the density 
matrix vanish at sufficiently long time. 

As a specific example, numerical simulations are performed for control of a four-level 
system by means of incoherent radiation with the distribution in terms of the magnitude 
of the photon momenta = |k|; no direct coherent control is present such that e{t) = 0. 
The system has the free Hamiltonian Hq = diag{0; 11; 13; 24} and dipole moment matrix 

/ 0.8 0.3 0.5 \ 
0.8 0.2 0.7 
0.3 0.2 1 
\0.5 0.7 1 / 

All transitions amongst the energy levels are allowed, and the system is assumed to initially 
be in its ground state. The goal of the control effort is to steer the system to a target 
mixed state p^ (we consider several examples for p"""). The corresponding objective function 
is chosen to have the form J[nk] = [J2nmipnm — Pnm)'^Y^'^ where pnm and p'^^ are elements 
of the system's density matrix and the target density matrix, respectively. The goal is to 
minimize the objective function at a sufficiently long time such that p is stationary. 

Each distribution function nk determines the quantum dynamical semigroup P^^^ = e*^"k , 
t > with generator C^^^l - ] := —i[HQ, ■] + £|j;gg[nk; ■]. An invariant state of the semigroup 
Pinv is defined by £„^[pinv] = (thus P^^{pi^^) = pinv)- The solution of Eq. ((H) with initial 
condition p(t = 0) = po has the form p(t) = P^^[pq]. If the coefficients 7^ are nonzero then 
the system density matrix p{t) will converge at long time to pinv For a given distribution 
function rzk one can compute the corresponding Lindblad operator and find its invariant. 
Here we consider the inverse problem: given a target state p"^ find a distribution function of 
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FIG. 2: Results of ICE simulations with tailored incoherent radiation as the control for tar- 
get states (a) = diag(0.3; 0.3; 0.2; 0.2), (b) = diag(0.3; 0.2; 0.3; 0.2), and (c) = 
diag(0.4; 0.1; 0.4; 0.1). Each case shows: the objective function vs GA generation, the optimal 
spectral distribution vs frequency, and the evolution of the diagonal matrix elements of the density 
matrix for the optimal distribution. In the plots for the objective function the upper curve is the 
average value for the objective function and the lower one is the best value in each generation. 



the environment which generates the Lindblad operator whose invariant state is as 
close as possible to p'^. In the case of control by radiation only values of rik at the system 
transition frequencies generally have an effect on the dynamics. In principle, these values 
of nk could be calculated if there was full knowledge of the system, environment and their 
interaction. This circumstance is rarely the case, and for a gaseous medium the situation 
is even more complex, since generally all modes of the gas contribute to the dynamics [see 
Eq. Even with all of these uncertain conditions learning control can be effective because 
it only relies on laboratory control-response observations. 
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The non-equilibrium radiation distribution function is modelled as = 
exp(— exp[— (A; — ki)"^ /2Di\/ y/2TxDi with /3 = 1/20. The parameters Di and ki 
are optimized over ranges large enough to include all of the system transition frequencies. 
A set of these parameters forms an individual in the population for the GA to operate with. 
Each individual determines a distribution function rtk which is used to drive the evolution 
of the system density matrix p{t) towards its stationary form as t — > oo to ultimately de- 
termine the objective function. In practice, the time t only needs to be taken out to some 
small multiple of the longest timescale of the system transitions. 

A GA with crossover and mutation operators is used to find the optimal values for ki 
and Di. The number of individuals in each population is 14. The total of 20 variables 
form an individual corresponding to ten parameters ki and Di. Each variable is coded 
into a string of 20 bits. The two most fit individuals are always retained in the successive 
generation. The remaining twelve individuals are produced from the parent population 
using the crossover and mutation operators. The fitness function determines the number 
of times each individual from the parent generation is chosen to produce offsprings in the 
next generation. The probability of crossover is Px = 0.9 and of mutation (a bit flip) is 
Pm = 0.7/Lind, where L^^^ = 400 is the length of the bit string forming an individual. 

The results of the simulations for three different target states are presented in Fig. El Each 
case contains plots of the objective function vs generation, the optimal spectral distribution 
function vs frequency, and the corresponding evolution of the system density matrix. In 
each case the target objective is met very well with a suitable spectral distribution function. 
The fitness function in cases (a) and (b) has small values for the initial population because 
most randomly chosen distribution functions induce states of the system which are close 
to the equally populated state. Hence, the search for a distribution function which steers 
the system to a complex target state is a non-trivial problem. A general expectation is that 
certain modes of radiation will promote transitions to the target state whereas others may 
be harmful for control. Thus, it is found that distinct distributions of radiation energy are 
required to most effectively steer the system into each particular target state. Each of the 
radiation distribution functions has components at all of the system transition frequencies, 
and the mechanism of the control is not simply evident in each case. Further analysis [17| 
would be needed to identify the control mechanism. In keeping with the logic of the adaptive 
control technique, the learning algorithm deduces how to distribute the radiation energy 
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without specific knowledge or details of the microscopic dynamics, including the Hamiltonian 
and initial state of the system, as well as details of its coupling to the non-equilibrium 
environment. Fig. El shows that excellent results can be achieved even with diverse choices 
for the target density matrix. 

Using incoherent radiation as the control clearly has some limitations. For example, 
incoherent radiation can not create population inversion in a two-level atom. However 
incoherent radiation can steer a quantum system from a pure state to a mixed one, possibly 
of a complex structure as indicated in Fig. |21 and in some cases specific pure states can also 
be reached. Tailored incoherent radiation also can be used jointly with a laser field e{t) to 
improve the degree of system control when significant laser restrictions exist, such as bounds 
on laser intensity or bandwidth. 



III. A GASEOUS MEDIUM SERVING AS A CONTROL 

This section considers ICE with a non-equilibrium density r2k(i) of gas particles such as 
electrons, atoms or molecules serving as the control. Quantum systems interacting with such 
gases are described by master equations whose dissipative generators are different from C^^^^ 
in Section 11. The gas is assumed to be sufficiently dilute such that the reduced dynamics 
of the system is Markovian. In this case the probability of simultaneous interaction of 
the system with two or more particles of the gas is negligible and the reduced dynamics 
is determined by two body scattering events between one particle of the system and one 
particle of the gas. The assumption of the rarity of the gas is not a restriction for ICE, and 
dense gases might be used for control as well. 

The master equation for a system interacting with a coherent electromagnetic field e{t) 
and a gas has the form of Eq. ((H) with the dissipative generator /^dissi'T-kl^); p] = '^diss[^k(i); p] 
specified by the distribution function of the gas nk(t) and by the T-operator (transition 
matrix) for scattering of the system and a gas particle. A transition matrix element is 
T„^„/(k, k') = {n,k\T\n' ,]<.') , where |n, k) = |n)|k) denotes the product state of the system 
discrete eigenstate \n) (an eigenstate of the system's free Hamiltonian Hq with eigenvalue 
En) and a translational state of the system and a gas particle with relative momentum k. 
If the system is fixed in space (we consider this case below corresponding to the system 
particle being much more massive than the particles of the surrounding gas) then |k) is a 
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translation state of a gas particle. The general case of relative system gas particle motion 
can be considered as well using suitable master equations. We assume that the particles 
of the gas are characterized only by their momenta and do not have internal degrees of 
freedom; otherwise, the state of one particle of the gas should have the form |k, a), where 
a specifies the state of the internal degrees of freedom. It is convenient to introduce the 
notation T(^(k, k') := ^ Tm,n(k, k')l"^)('^l- The density of particles of the gas at 

m,n: em—e-n=^ 

momentum k is nk(t), the kinetic energy of a gas particle of mass M is |k|V2M, and B 
is the set of transition frequencies lo of the system among the energy levels of ifo- In this 
notation the dissipation generator is 

4:;M^);p] = 2vr$^ / dkrfk'5(^-^ + ^)Mi)[T.(k',k)pT+(k',k) 

(T+(k', k)T^(k', k)p + pT+(k', k)T^(k', k))] (3) 

If the gas is at equilibrium with inverse temperature /5, then the density is stationary and 
has the Boltzmann form n\Sf) = n-^ = C(/5, n) exp(— /3|kp/2M), where the normaliza- 
tion constant C{j3,n) is determined by the condition J rfkn^ = n, where n is the total 
density of the gas. The structure of Eq. (jS)) for equilibrium gases has been discussed 
previously in Q| and for non-equilibrium stationary gases in Non-equilibrium 
gases may be characterized by generally time dependent distributions. Equation (0) with 
'Cdiss['^k(i'^); pit)] = 'Cdi^s[^k(i); pit)] is the general formulation for control by both a coherent 
electromagnetic field eit) and a non-equilibrium gas density Tik(^)- As a simple illustration 
of ICE we only consider a simulation for control by a static non-equilibrium distribution n^. 

Let Ei^n = |kp/2M + En be the initial energy of the total system consisting of one gas 
particle and the controlled system before collision and let -E/,m = |k'p/2M + £m be the final 
energy after the collision. If for each transition frequency u there are only two system levels 
n and m such that u = En — £m,, then Eq. for the diagonal elements of the density matrix 
reduces to the Pauli master equation 

-^-^^ = 2 ^[wjnPnn(^) " Wnlpuit)] 
n 

where the transition probability Wmn = ^/ rfknk(t) / (ik'5(i?j„i — i?j^„)|Tm„(k', k)p between 
levels n and m is explicitly determined by the distribution function nk(t). 

The quantity / dk'diEf^rn — I ^mn(k', k)p defines the scattering cross section between 
the system and one gas particle. There are two possible strategies for investigating the 
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FIG. 3: Results of ICE simulations with a surrounding non-equilibrium gas as the control for 
target states (a) = diag(0.3; 0.3; 0.2; 0.2), (b) p'^ = diag(0.3; 0.2; 0.3; 0.2), and (c) p^ = 
diag(0.4; 0.1; 0.4; 0.1). Each case shows: the objective function vs GA generation, the optimal 
distribution vs momentum, and the evolution of the diagonal elements of the density matrix for 
the optimal distribution. In the plots for the objective function the upper curve is the average 
value for the objective function and the lower one is the best value in each generation. 



prospects for control by a non-equilibrium gas. First, one may start with the microscopic 
interaction Hamiltonian between the system and a particle of the gas to compute the T- 
matrix for use in the dissipative generator (j2)). The second strategy is to use experimentally 
measured cross sections for the same purpose. As our purpose here is to illustrate the 
prospect of closed-loop laboratory learning control with ICE, we will simply use the first 
option in a model. 

In the simulations we consider a four-level system with the same free Hamiltonian Ho 
in Section II, immersed in a dilute gas. The system is initially prepared in its ground 
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state and the goal of the control is to steer the system to a target state (we consider 
the same target states and the same fitness function as in Section II). The interaction V 
between the system and one particle of the gas is considered to be weak with matrix elements 
V^m(k, k') = (n, k| V|m, k') = yU„mfl'n(k)(7m(k') defined by the matrix /i of the form and by 
the functions (7n(k) chosen as characteristic functions of the momentum magnitude k = |k|, 
(7„(k) = X[a„,fe„](^)- Here X[a„,fe„](^) has unit value if a„ < < 6„ and zero otherwise. The 
parameters a„ and 6„ are chosen randomly as ai = 2, bi = 12, a2 = 9, 62 = 24, = 3, 
63 = 17, 04 = 14, and 64 = 26. We chose M = 1. 

The matrix /i describes transitions between the system's energy levels due to interaction 
with the gas and the functions 5'n(k) describe the corresponding change in the momentum 
of a gas particle. For a general fi its diagonal elements are responsible for elastic scattering 
of the system (these elements are zero in our case), whereas the off-diagonal ones control 
the inelastic events. Spontaneous emission from the upper levels is assumed to be negligible, 
corresponding to the lifetimes of the excited states being much longer than the inverse 
transition rates due to collisions with the gas. Elastic scattering and spontaneous emission 
can be included when necessary, and all physical processes could naturally be present in a 
laboratory closed loop experiment. 

The weak nature of the interaction allows for replacing the T-operator in Eq. Q by the 
interaction Hamiltonian V. The control in the simulations is a static distribution of the 
form rik = exp(— /5fc^) exp[— (A; — kiY /2Di\/ ^fhiTTi with (3 = 0.01. The parameters 
ki and Di are optimally determined by the GA. This distribution together with interaction 
Hamiltonian determines the dissipative generator C'^^^^ according to Eq. and the evolution 
of the system density matrix according to the master equation (P) (with e{t) = in this 
case). The goal of the control is to find a stationary non-equilibrium state of the gas which 
steers the system to a target state p^. If desired, either the constraint of a fixed total energy 
of the gas or its cost for minimizing its value could be included by adding appropriate terms 
to the objective function. Fig. IHl gives the results of the numerical simulation for different 
target states. The simulations show that diverse mixed states can be reached very well 
by manipulating the momentum distribution function ra^ using ICE based on a learning 
algorithm. 
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IV. CONCLUSIONS 

In this paper learning control with a non-equilibrium environment is proposed as a means 
for manipulating quantum systems. Two cases are simulated: control by incoherent radiation 
and by a gas of particles. The control is the distribution of mean occupation numbers 
of the environment. The control affects the physical system through tailored dissipative 
dynamics and allows for steering an initial pure or a mixed state into a complex target 
state. The search for an optimal control distribution in ICE is performed by a learning 
control strategy, which could be implemented in the laboratory without detailed knowledge 
of the system Hamiltonian, coupling to the environment, etc. The method can be generalized 
by combining standard laser control and the proposed ICE control. In the latter case a most 
interesting situation would be attaining states which can not be obtained by using either 
restricted lasers or a non-equilibrium environment alone. An open issue is to establish the 
degree of control, i.e., the set of states reachable with ICE. 
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